Conversational Engagement Recognition Using Auditory and Visual Cues
نویسندگان
چکیده
Automatic prediction of engagement in human-human and human-machine dyadic and multiparty interaction scenarios could greatly aid in evaluation of the success of communication. A corpus of eight face-to-face dyadic casual conversations was recorded and used as the basis for an engagement study, which examined the effectiveness of several methods of engagement level recognition. A convolutional neural network based analysis was seen to be the most effective.
منابع مشابه
Speaker Dependency Analysis, Audiovisual Fusion Cues and a Multimodal BLSTM for Conversational Engagement Recognition
Conversational engagement is a multimodal phenomenon and an essential cue to assess both human-human and human-robot communication. Speaker-dependent and speaker-independent scenarios were addressed in our engagement study. Handcrafted audio-visual features were used. Fixed window sizes for feature fusion method were analysed. Novel dynamic window size selection and multimodal bi-directional lo...
متن کاملTowards Context-Based Visual Feedback Recognition for Embodied Agents
Head pose and gesture offer several key conversational grounding cues and are used extensively in face-to-face interaction among people. We investigate how contextual information can improve visual recognition of feedback gestures during interactions with embodied conversational agents. We present a visual recognition model that integrates cues from the spoken dialogue of an embodied agent with...
متن کاملUsing Eye Movement Analysis to Study Auditory Effects on Visual Memory Recall
Recent studies in affective computing are focused on sensing human cognitive context using biosignals. In this study, electrooculography (EOG) was utilized to investigate memory recall accessibility via eye movement patterns. 12 subjects were participated in our experiment wherein pictures from four categories were presented. Each category contained nine pictures of which three were presented t...
متن کاملIs it Possible to Evaluate the Contribution of Visual Information to the Process of Speech Comprehension?
We report in this paper the results of a series of comprehension tests run with the aim of investigating the contribution of visual information to the process of comprehension of conversational speech. The methodology we designed was presented in a previous work [1] in which we also showed the results of a pilot test to confirm our original hypothesis that the comprehension of conversational sp...
متن کاملAuditory and auditory-visual recognition of clear and conversational speech by older adults.
Research has shown that speech articulated in a clear manner is easier to understand than conversationally spoken speech in both the auditory-only (A-only) and auditory-visual (AV) domains. Because this research has been conducted using younger adults, it is unknown whether age-related changes in auditory and/or visual processing affect older adults' ability to benefit when a talker speaks clea...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016